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iDistance: An adaptive BMree based indexing method for nearest nei g hbor search I I 

H. V. Jagadish, Beng Chin Ooi, Kian-Lee Tan, Cui Yu, Rui Zhang 

June 2005 ACM Transactions on Database Systems (TODS), volume 30 issue 2 

Publisher: ACM Press 

Full text available: ^pdf(1.16 MB) Additional Information: full citation , abstract , references , index terms 

In this article, we present an efficient B &plus; -tree based indexing method, called iDistance, 
for K-nearest neighbor (KNN) search in a high-dimensional metric space. iDistance 
partitions the data based on a space- or data-partitioning strategy, and selects a 
reference point for each partition. The data points in each partition are transformed into a 
single dimensional value based on their similarity with respect to the reference point. This 
allows the points to be indexed using a B 

Keyw ord s ; Indexing, KNN, nearest neighbor quer i es f 



2 Imag e-based reconstruction of spatial a p pearance and g eometric detail I I 

Hendrik P. A. Lensch, Jan Kautz, Michael Goesele, Wolfgang Heidrich, Hans-Peter Seidel 
April 2003 ACM Transactions on Graphics (TOG), volume 22 issue 2 
Publisher: ACM Press 

Full text available* f 3 pdf (3Q2 22 KB) Addit ' onal Information: full citation , abstract , references , citings , index 

terms 

Real-world objects are usually composed of a number of different materials that often 
show subtle changes even within a single material. Photorealistic rendering of such 
objects requires accurate measurements of the reflection properties of each material, as 
well as the spatially varying effects. We present an image-based measuring method that 
robustly detects the different materials of real objects and fits an average bidirectional 
reflectance distribution function (BRDF) to each of them. In or ... 

Keywords: BRDF measurement, normal map acquisition, photometric stereo, shape from 
shading, spatially varying BRDFs 



3 Contrast Plots and P-Sphere Trees: Space vs. Time in Nearest Neighbour Searches I I 
Jonathan Goldstein, Raghu Ramakrishnan 

September 2000 Proceedings of the 26th International Conference on Very Large Data 
Bases VLDB 'OO 

Publisher: Morgan Kaufmann Publishers Inc. 
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Predictive performance and scalability modelin g of a large-scale application Q 

D. j. Kerbyson, H. J. Alme, A. Hoisie, F. Petrini, H. J. Wasserman, M. Gittings 

November 2001 Proceedings of the 2001 ACM/IEEE conference on Supercomputing 

(CDROM) 
Publisher: ACM Press 

Full text available- fill pdf( 1 87 50 KB) Adc,itl0na, Information: full citation , abstract , references , citings , index 
- = terms 

In this work we present a predictive analytical model that encompasses the performance 
and scaling characteristics of an important ASCI application. SAGE (SAIC's Adaptive Grid 
Eulerian hydrocode) is a multidimensional hydrodynamics code with adaptive mesh 
refinement. The model is validated against measurements on several systems including 
ASCI Blue Mountain, ASCI White, and a Compaq AlphaServer ES45 system showing high 
accuracy. It is parametric --- basic machine performance numbers (latency, MF ... 

Keywords: Performance analysis, Teraflop scale computing, full application codes, 
parallel system architecture 



5 Alg orithms and applications for answering ranked queries usin g ranked views 
Vagelis Hristidis, Yannis Papakonstantinou 

January 2004 The VLDB Journal — The International Journal on Very Large Data 

Bases, Volume 13 Issue 1 
Publisher: Springer-Verlag New York, Inc. 

Full text available: ^ pdf(478.90 KB) Additional Information: full citation , abstract, index terms 

Ranked queries return the top objects of a database according to a preference function. 
We present and evaluate (experimentally and theoretically) a core algorithm that answers 
ranked queries in an efficient pipelined manner using materialized ranked views. We use 
and extend the core algorithm in the described PREFER and MERGE systems. PREFER 
precomputes a set of materialized views that provide guaranteed query performance. We 
present an algorithm that selects a near optimal set of views under s ... 

Keywords: Materialization, Merge ranked views, Ranked queries 
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6 Database session 1: queryin g hi g h-dimensional data: Approximate searches: k- fj 
^ neighbors + precision 

^ Sid-Ahmed Berrani, Laurent Amsaleg, Patrick Gros 

November 2003 Proceedings of the twelfth international conference on Information 

and knowledge management 
Publisher: ACM Press 

Full text available* 1?3 Ddf(154 57 KB) Additional Information: full citation , abstract , references , citings , index 
' ^ terms 

It is known that all multi-dimensional index structures fail to accelerate content-based 
similarity searches when the feature vectors describing images are high-dimensional. It is 
possible to circumvent this problem by relying on approximate search-schemes trading-off 
result quality for reduced query execution time. Most approximate schemes, however, 
provide none or only complex control on the precision of the searches, especially when 
retrieving the k nearest neighbors (NNs) of query poi ... 

Keywords: approximate nearest-neighbor searches, multimedia databases, similarity 
searches 
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7 Searching in metric spaces with user-defined and approximate distances Q 
Paolo Ciaccia, Marco Patella 

December 2002 ACM Transactions on Database Systems (TODS), volume 27 issue 4 
Publisher: ACM Press 

Full text available* fifl pdf(555 89 KB) Additional Information: full citation , abstract , references , citin gs, index 
' ^ terms 

Novel database applications, such as multimedia, data mining, e-commerce, and many 
others, make intensive use of similarity queries in order to retrieve the objects that better 
fit a user request. Since the effectiveness of such queries improves when the user is 
allowed to personalize the similarity criterion according to which database objects are 
evaluated and ranked, the development of access methods able to efficiently support 
user-defined similarity queries becomes a basic requirement. In t ... 

Keywords: Distance metrics, user-defined queries 



Fast subsequence matchin g in time-series databases I I 

Christos Faloutsos, M. Ranganathan, Yannis Manolopoulos 

May 1994 ACM SZGMOD Record , Proceedings of the 1994 ACM SIGMOD international 

conference on Management of data SIGMOD '94, volume 23 issue 2 
Publisher: ACM Press 

Full text available: IB pdf(1.01 MB) Additional Information: full citation, abstract, references, cjtjngs, index 

terms 

We present an efficient indexing method to locate 1-dimensional subsequences within a 
collection of sequences, such that the subsequences match a given (query) pattern within 
a specified tolerance. The idea is to map each data sequences into a small set of 
multidimensional rectangles in feature space. Then, these rectangles can be readily 
indexed using traditional spatial access methods, like the R*-tree [9]. In more detail, we 
use a sliding window over the data sequence and extract its fea ... 



9 Database theory , technology and a p plications (DTTA): Accelerating approximate 
similarity queries using genetic algorithms 
Renato Bueno, Agma J. M. Traina, Caetano Traina 

March 2005 Proceedings of the 2005 ACM symposium on Applied computing SAC '05 
Publisher: ACM Press 

Full text available:^ pdf(276. 17 KB) Additional Information: full citation , abstract , references , index terms 

Searching for the exact answer to a similarity query is an expensive process considering 
computational resources, such as memory and processing time requirements. Moreover, 
comparison operations over multimedia data is even more expensive than over traditional 
data such as numbers and small character strings. Therefore, when comparing multimedia 
data, the comparison computations usually consider some properties extracted from the 
data elements. In this way, exact queries involving this kind of d ... 

Keywords: approximate queries, genetic algorithms, metric access methods, similarity 
queries 




10 Demonstration session 2: Identifying audio clips with RARE I I 

Chris J. C. Burges, John C. Piatt, Jonathan Goldstein 

November 2003 Proceedings of the eleventh ACM international conference on 
Multimedia 
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Publisher: ACM Press 

Full text available: ^pdf d 59.91 KB) Additional Information: full citation , abstract , references , index terms 

In this paper, we describe RARE (Robust Audio Recognition Engine): a system for 
identifying audio streams and files. RARE can be used in a variety of applications: from 
enhancing the consumer listening experience to cleaning large audio databases. RARE was 
designed with two key qualities in mind: robustness to distortion of the audio, and lookup 
speed. RARE identifies audio clips in a stream against a database of 1/4 million songs in 
real time using approximately 10% CPU on an 850 MHz P3, and wi ... 

Keywords: audio fingerprinting, fast indexing, robust lookup 



11 Similarity queries I: Efficient similarity search and classification via rank aggregation I I 
Ronald Fagin, Ravi Kumar, D. Sivakumar 

June 2003 Proceedings of the 2003 ACM SIGMOD international conference on 
Management of data 

Publisher: ACM Press 

Full text available: I Bpdfd 98,89 KB) Additional Information: full citation , abstract , references , citings, index 

: terms 

We propose a novel approach to performing efficient similarity search and classification in 
high dimensional data. In this framework, the database elements are vectors in a 
Euclidean space. Given a query vector in the same space, the goal is to find elements of 
the database that are similar to the query. In our approach, a small number of 
independent "voters" rank the database elements based on similarity to the query. These 
rankings are then combined by a highly efficient aggregation algorithm. ... 
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